Overview
Dataset statistics
| Number of variables | 11 |
|---|---|
| Number of observations | 1000 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 74.3 KiB |
| Average record size in memory | 76.1 B |
Variable types
| Text | 2 |
|---|---|
| Categorical | 3 |
| Numeric | 5 |
| DateTime | 1 |
amount is highly overall correlated with log_amount and 1 other fields | High correlation |
log_amount is highly overall correlated with amount and 1 other fields | High correlation |
sqrt_amount is highly overall correlated with amount and 1 other fields | High correlation |
transaction_id has unique values | Unique |
Reproduction
| Analysis started | 2026-02-21 12:32:55.632493 |
|---|---|
| Analysis finished | 2026-02-21 12:33:05.463609 |
| Duration | 9.83 seconds |
| Software version | ydata-profiling vv4.18.1 |
| Download configuration | config.json |
Variables
transaction_id
Text
Unique
| Distinct | 1000 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.9 KiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 1000 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | T000001 |
|---|---|
| 2nd row | T000002 |
| 3rd row | T000003 |
| 4th row | T000004 |
| 5th row | T000005 |
| Value | Count | Frequency (%) |
| t000001 | 1 | 0.1% |
| t000002 | 1 | 0.1% |
| t000003 | 1 | 0.1% |
| t000004 | 1 | 0.1% |
| t000005 | 1 | 0.1% |
| t000006 | 1 | 0.1% |
| t000007 | 1 | 0.1% |
| t000008 | 1 | 0.1% |
| t000009 | 1 | 0.1% |
| t000010 | 1 | 0.1% |
| Other values (990) | 990 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3299 | |
| T | 1000 | 14.3% |
| 1 | 301 | 4.3% |
| 2 | 300 | 4.3% |
| 3 | 300 | 4.3% |
| 4 | 300 | 4.3% |
| 5 | 300 | 4.3% |
| 6 | 300 | 4.3% |
| 7 | 300 | 4.3% |
| 8 | 300 | 4.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 7000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 3299 | |
| T | 1000 | 14.3% |
| 1 | 301 | 4.3% |
| 2 | 300 | 4.3% |
| 3 | 300 | 4.3% |
| 4 | 300 | 4.3% |
| 5 | 300 | 4.3% |
| 6 | 300 | 4.3% |
| 7 | 300 | 4.3% |
| 8 | 300 | 4.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 7000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 3299 | |
| T | 1000 | 14.3% |
| 1 | 301 | 4.3% |
| 2 | 300 | 4.3% |
| 3 | 300 | 4.3% |
| 4 | 300 | 4.3% |
| 5 | 300 | 4.3% |
| 6 | 300 | 4.3% |
| 7 | 300 | 4.3% |
| 8 | 300 | 4.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 7000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 3299 | |
| T | 1000 | 14.3% |
| 1 | 301 | 4.3% |
| 2 | 300 | 4.3% |
| 3 | 300 | 4.3% |
| 4 | 300 | 4.3% |
| 5 | 300 | 4.3% |
| 6 | 300 | 4.3% |
| 7 | 300 | 4.3% |
| 8 | 300 | 4.3% |
user_id
Text
| Distinct | 200 |
|---|---|
| Distinct (%) | 20.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.9 KiB |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 5 |
| Min length | 5 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | U0024 |
|---|---|
| 2nd row | U0196 |
| 3rd row | U0196 |
| 4th row | U0133 |
| 5th row | U0047 |
| Value | Count | Frequency (%) |
| u0117 | 13 | 1.3% |
| u0147 | 12 | 1.2% |
| u0052 | 12 | 1.2% |
| u0024 | 11 | 1.1% |
| u0182 | 11 | 1.1% |
| u0116 | 11 | 1.1% |
| u0141 | 9 | 0.9% |
| u0028 | 9 | 0.9% |
| u0150 | 9 | 0.9% |
| u0030 | 9 | 0.9% |
| Other values (190) | 894 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1671 | |
| U | 1000 | |
| 1 | 720 | |
| 2 | 222 | 4.4% |
| 4 | 209 | 4.2% |
| 7 | 207 | 4.1% |
| 3 | 203 | 4.1% |
| 8 | 200 | 4.0% |
| 5 | 194 | 3.9% |
| 6 | 190 | 3.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 5000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1671 | |
| U | 1000 | |
| 1 | 720 | |
| 2 | 222 | 4.4% |
| 4 | 209 | 4.2% |
| 7 | 207 | 4.1% |
| 3 | 203 | 4.1% |
| 8 | 200 | 4.0% |
| 5 | 194 | 3.9% |
| 6 | 190 | 3.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 5000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1671 | |
| U | 1000 | |
| 1 | 720 | |
| 2 | 222 | 4.4% |
| 4 | 209 | 4.2% |
| 7 | 207 | 4.1% |
| 3 | 203 | 4.1% |
| 8 | 200 | 4.0% |
| 5 | 194 | 3.9% |
| 6 | 190 | 3.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 5000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1671 | |
| U | 1000 | |
| 1 | 720 | |
| 2 | 222 | 4.4% |
| 4 | 209 | 4.2% |
| 7 | 207 | 4.1% |
| 3 | 203 | 4.1% |
| 8 | 200 | 4.0% |
| 5 | 194 | 3.9% |
| 6 | 190 | 3.8% |
product_id
Categorical
| Distinct | 50 |
|---|---|
| Distinct (%) | 5.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.9 KiB |
| P042 | 31 |
|---|---|
| P007 | 31 |
| P023 | 30 |
| P009 | 28 |
| P024 | 26 |
| Other values (45) |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | P015 |
|---|---|
| 2nd row | P044 |
| 3rd row | P049 |
| 4th row | P042 |
| 5th row | P038 |
Common Values
| Value | Count | Frequency (%) |
| P042 | 31 | 3.1% |
| P007 | 31 | 3.1% |
| P023 | 30 | 3.0% |
| P009 | 28 | 2.8% |
| P024 | 26 | 2.6% |
| P008 | 25 | 2.5% |
| P029 | 25 | 2.5% |
| P037 | 25 | 2.5% |
| P006 | 24 | 2.4% |
| P049 | 24 | 2.4% |
| Other values (40) | 731 |
Length
Histogram of lengths of the category
| Value | Count | Frequency (%) |
| p042 | 31 | 3.1% |
| p007 | 31 | 3.1% |
| p023 | 30 | 3.0% |
| p009 | 28 | 2.8% |
| p024 | 26 | 2.6% |
| p008 | 25 | 2.5% |
| p029 | 25 | 2.5% |
| p037 | 25 | 2.5% |
| p006 | 24 | 2.4% |
| p049 | 24 | 2.4% |
| Other values (40) | 731 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1287 | |
| P | 1000 | |
| 4 | 320 | 8.0% |
| 2 | 313 | 7.8% |
| 3 | 296 | 7.4% |
| 1 | 265 | 6.6% |
| 9 | 119 | 3.0% |
| 5 | 106 | 2.6% |
| 7 | 103 | 2.6% |
| 8 | 97 | 2.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 4000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1287 | |
| P | 1000 | |
| 4 | 320 | 8.0% |
| 2 | 313 | 7.8% |
| 3 | 296 | 7.4% |
| 1 | 265 | 6.6% |
| 9 | 119 | 3.0% |
| 5 | 106 | 2.6% |
| 7 | 103 | 2.6% |
| 8 | 97 | 2.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 4000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1287 | |
| P | 1000 | |
| 4 | 320 | 8.0% |
| 2 | 313 | 7.8% |
| 3 | 296 | 7.4% |
| 1 | 265 | 6.6% |
| 9 | 119 | 3.0% |
| 5 | 106 | 2.6% |
| 7 | 103 | 2.6% |
| 8 | 97 | 2.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 4000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1287 | |
| P | 1000 | |
| 4 | 320 | 8.0% |
| 2 | 313 | 7.8% |
| 3 | 296 | 7.4% |
| 1 | 265 | 6.6% |
| 9 | 119 | 3.0% |
| 5 | 106 | 2.6% |
| 7 | 103 | 2.6% |
| 8 | 97 | 2.4% |
amount
Real number (ℝ)
High correlation
| Distinct | 851 |
|---|---|
| Distinct (%) | 85.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 59.93868 |
| Minimum | 19.53 |
|---|---|
| Maximum | 132.41 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.9 KiB |
Quantile statistics
| Minimum | 19.53 |
|---|---|
| 5-th percentile | 21.28 |
| Q1 | 37.745 |
| median | 56.39 |
| Q3 | 76.92 |
| 95-th percentile | 123.1815 |
| Maximum | 132.41 |
| Range | 112.88 |
| Interquartile range (IQR) | 39.175 |
Descriptive statistics
| Standard deviation | 29.013313 |
|---|---|
| Coefficient of variation (CV) | 0.48404992 |
| Kurtosis | -0.025561975 |
| Mean | 59.93868 |
| Median Absolute Deviation (MAD) | 19.355 |
| Skewness | 0.78538545 |
| Sum | 59938.68 |
| Variance | 841.77235 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 57 | 53 | 5.3% |
| 132.41 | 31 | 3.1% |
| 19.53 | 31 | 3.1% |
| 36.93 | 3 | 0.3% |
| 59.75 | 3 | 0.3% |
| 30.15 | 2 | 0.2% |
| 38.62 | 2 | 0.2% |
| 58.68 | 2 | 0.2% |
| 74.37 | 2 | 0.2% |
| 20.15 | 2 | 0.2% |
| Other values (841) | 869 |
| Value | Count | Frequency (%) |
| 19.53 | 31 | |
| 19.57 | 1 | 0.1% |
| 19.65 | 1 | 0.1% |
| 19.66 | 1 | 0.1% |
| 19.77 | 1 | 0.1% |
| 19.78 | 1 | 0.1% |
| 20.05 | 1 | 0.1% |
| 20.06 | 1 | 0.1% |
| 20.15 | 2 | 0.2% |
| 20.23 | 1 | 0.1% |
| Value | Count | Frequency (%) |
| 132.41 | 31 | |
| 132.32 | 1 | 0.1% |
| 130.59 | 1 | 0.1% |
| 130.41 | 1 | 0.1% |
| 130.08 | 1 | 0.1% |
| 129.83 | 1 | 0.1% |
| 129.66 | 1 | 0.1% |
| 129.64 | 1 | 0.1% |
| 129.55 | 1 | 0.1% |
| 128.03 | 1 | 0.1% |
payment_type
Categorical
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.9 KiB |
| Wallet | |
|---|---|
| UPI | |
| Cash | |
| Net Banking | |
| Debit Card |
Length
| Max length | 11 |
|---|---|
| Median length | 6 |
| Mean length | 7.349 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Wallet |
|---|---|
| 2nd row | UPI |
| 3rd row | Debit Card |
| 4th row | Net Banking |
| 5th row | Net Banking |
Common Values
| Value | Count | Frequency (%) |
| Wallet | 180 | |
| UPI | 177 | |
| Cash | 168 | |
| Net Banking | 165 | |
| Debit Card | 159 | |
| Credit Card | 151 |
Length
Histogram of lengths of the category
Common Values (Plot)
| Value | Count | Frequency (%) |
| card | 310 | |
| wallet | 180 | |
| upi | 177 | |
| cash | 168 | |
| net | 165 | |
| banking | 165 | |
| debit | 159 | |
| credit | 151 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 823 | |
| e | 655 | 8.9% |
| t | 655 | 8.9% |
| C | 629 | 8.6% |
| 475 | 6.5% | |
| i | 475 | 6.5% |
| r | 461 | 6.3% |
| d | 461 | 6.3% |
| l | 360 | 4.9% |
| n | 330 | 4.5% |
| Other values (12) | 2025 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 7349 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 823 | |
| e | 655 | 8.9% |
| t | 655 | 8.9% |
| C | 629 | 8.6% |
| 475 | 6.5% | |
| i | 475 | 6.5% |
| r | 461 | 6.3% |
| d | 461 | 6.3% |
| l | 360 | 4.9% |
| n | 330 | 4.5% |
| Other values (12) | 2025 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 7349 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 823 | |
| e | 655 | 8.9% |
| t | 655 | 8.9% |
| C | 629 | 8.6% |
| 475 | 6.5% | |
| i | 475 | 6.5% |
| r | 461 | 6.3% |
| d | 461 | 6.3% |
| l | 360 | 4.9% |
| n | 330 | 4.5% |
| Other values (12) | 2025 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 7349 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 823 | |
| e | 655 | 8.9% |
| t | 655 | 8.9% |
| C | 629 | 8.6% |
| 475 | 6.5% | |
| i | 475 | 6.5% |
| r | 461 | 6.3% |
| d | 461 | 6.3% |
| l | 360 | 4.9% |
| n | 330 | 4.5% |
| Other values (12) | 2025 |
date
Date
| Distinct | 636 |
|---|---|
| Distinct (%) | 63.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.9 KiB |
| Minimum | 2023-01-01 00:00:00 |
|---|---|
| Maximum | 2025-11-01 00:00:00 |
| Invalid dates | 0 |
| Invalid dates (%) | 0.0% |
Histogram with fixed size bins (bins=50)
day
Real number (ℝ)
| Distinct | 31 |
|---|---|
| Distinct (%) | 3.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15.566 |
| Minimum | 1 |
|---|---|
| Maximum | 31 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.0 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 8 |
| median | 15 |
| Q3 | 24 |
| 95-th percentile | 29 |
| Maximum | 31 |
| Range | 30 |
| Interquartile range (IQR) | 16 |
Descriptive statistics
| Standard deviation | 9.1066226 |
|---|---|
| Coefficient of variation (CV) | 0.58503293 |
| Kurtosis | -1.2923437 |
| Mean | 15.566 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 0.0039781737 |
| Sum | 15566 |
| Variance | 82.930575 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=31)
| Value | Count | Frequency (%) |
| 26 | 49 | 4.9% |
| 10 | 47 | 4.7% |
| 8 | 44 | 4.4% |
| 22 | 42 | 4.2% |
| 1 | 42 | 4.2% |
| 3 | 41 | 4.1% |
| 23 | 40 | 4.0% |
| 4 | 39 | 3.9% |
| 29 | 38 | 3.8% |
| 2 | 36 | 3.6% |
| Other values (21) | 582 |
| Value | Count | Frequency (%) |
| 1 | 42 | |
| 2 | 36 | |
| 3 | 41 | |
| 4 | 39 | |
| 5 | 29 | |
| 6 | 31 | |
| 7 | 20 | |
| 8 | 44 | |
| 9 | 31 | |
| 10 | 47 |
| Value | Count | Frequency (%) |
| 31 | 21 | |
| 30 | 25 | |
| 29 | 38 | |
| 28 | 26 | |
| 27 | 30 | |
| 26 | 49 | |
| 25 | 36 | |
| 24 | 29 | |
| 23 | 40 | |
| 22 | 42 |
month
Real number (ℝ)
| Distinct | 12 |
|---|---|
| Distinct (%) | 1.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.331 |
| Minimum | 1 |
|---|---|
| Maximum | 12 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.0 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 4 |
| median | 6 |
| Q3 | 9 |
| 95-th percentile | 12 |
| Maximum | 12 |
| Range | 11 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 3.2814947 |
|---|---|
| Coefficient of variation (CV) | 0.5183217 |
| Kurtosis | -1.1373173 |
| Mean | 6.331 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 0.025847333 |
| Sum | 6331 |
| Variance | 10.768207 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=12)
| Value | Count | Frequency (%) |
| 5 | 103 | |
| 10 | 103 | |
| 6 | 97 | |
| 3 | 90 | |
| 8 | 89 | |
| 9 | 87 | |
| 4 | 84 | |
| 1 | 79 | |
| 2 | 76 | |
| 7 | 74 | |
| Other values (2) | 118 |
| Value | Count | Frequency (%) |
| 1 | 79 | |
| 2 | 76 | |
| 3 | 90 | |
| 4 | 84 | |
| 5 | 103 | |
| 6 | 97 | |
| 7 | 74 | |
| 8 | 89 | |
| 9 | 87 | |
| 10 | 103 |
| Value | Count | Frequency (%) |
| 12 | 56 | |
| 11 | 62 | |
| 10 | 103 | |
| 9 | 87 | |
| 8 | 89 | |
| 7 | 74 | |
| 6 | 97 | |
| 5 | 103 | |
| 4 | 84 | |
| 3 | 90 |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2023 |
|---|---|
| 2nd row | 2023 |
| 3rd row | 2025 |
| 4th row | 2024 |
| 5th row | 2025 |
Common Values
| Value | Count | Frequency (%) |
| 2023 | 350 | |
| 2024 | 337 | |
| 2025 | 313 |
Length
Histogram of lengths of the category
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2023 | 350 | |
| 2024 | 337 | |
| 2025 | 313 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 2000 | |
| 0 | 1000 | |
| 3 | 350 | 8.8% |
| 4 | 337 | 8.4% |
| 5 | 313 | 7.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 4000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 2 | 2000 | |
| 0 | 1000 | |
| 3 | 350 | 8.8% |
| 4 | 337 | 8.4% |
| 5 | 313 | 7.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 4000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 2 | 2000 | |
| 0 | 1000 | |
| 3 | 350 | 8.8% |
| 4 | 337 | 8.4% |
| 5 | 313 | 7.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 4000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 2 | 2000 | |
| 0 | 1000 | |
| 3 | 350 | 8.8% |
| 4 | 337 | 8.4% |
| 5 | 313 | 7.8% |
log_amount
Real number (ℝ)
High correlation
| Distinct | 942 |
|---|---|
| Distinct (%) | 94.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.053483 |
| Minimum | 2.7472709 |
|---|---|
| Maximum | 5.3933548 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.9 KiB |
Quantile statistics
| Minimum | 2.7472709 |
|---|---|
| 5-th percentile | 3.1036894 |
| Q1 | 3.6569987 |
| median | 4.0498695 |
| Q3 | 4.4300426 |
| 95-th percentile | 5.0484842 |
| Maximum | 5.3933548 |
| Range | 2.6460839 |
| Interquartile range (IQR) | 0.77304387 |
Descriptive statistics
| Standard deviation | 0.57500838 |
|---|---|
| Coefficient of variation (CV) | 0.14185538 |
| Kurtosis | -0.37510149 |
| Mean | 4.053483 |
| Median Absolute Deviation (MAD) | 0.38553924 |
| Skewness | 0.065426447 |
| Sum | 4053.483 |
| Variance | 0.33063463 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 2.747270914 | 11 | 1.1% |
| 5.393354782 | 11 | 1.1% |
| 4.106767082 | 3 | 0.3% |
| 3.635742356 | 3 | 0.3% |
| 3.051639905 | 2 | 0.2% |
| 3.666378189 | 2 | 0.2% |
| 4.150252194 | 2 | 0.2% |
| 4.322409318 | 2 | 0.2% |
| 4.003872659 | 2 | 0.2% |
| 3.173878459 | 2 | 0.2% |
| Other values (932) | 960 |
| Value | Count | Frequency (%) |
| 2.747270914 | 11 | |
| 2.772588722 | 1 | 0.1% |
| 2.789322921 | 1 | 0.1% |
| 2.813010637 | 1 | 0.1% |
| 2.817801065 | 1 | 0.1% |
| 2.820783471 | 1 | 0.1% |
| 2.835563521 | 1 | 0.1% |
| 2.836736542 | 1 | 0.1% |
| 2.837322537 | 1 | 0.1% |
| 2.844909384 | 1 | 0.1% |
| Value | Count | Frequency (%) |
| 5.393354782 | 11 | |
| 5.383944453 | 1 | 0.1% |
| 5.37920587 | 1 | 0.1% |
| 5.367096882 | 1 | 0.1% |
| 5.356208845 | 1 | 0.1% |
| 5.318806033 | 1 | 0.1% |
| 5.302956589 | 1 | 0.1% |
| 5.296916386 | 1 | 0.1% |
| 5.291393451 | 1 | 0.1% |
| 5.290436393 | 1 | 0.1% |
sqrt_amount
Real number (ℝ)
High correlation
| Distinct | 902 |
|---|---|
| Distinct (%) | 90.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.8179518 |
| Minimum | 4.419276 |
|---|---|
| Maximum | 13.120976 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.9 KiB |
Quantile statistics
| Minimum | 4.419276 |
|---|---|
| 5-th percentile | 4.613025 |
| Q1 | 6.1436911 |
| median | 7.5093265 |
| Q3 | 9.1068652 |
| 95-th percentile | 12.441315 |
| Maximum | 13.120976 |
| Range | 8.7016996 |
| Interquartile range (IQR) | 2.9631742 |
Descriptive statistics
| Standard deviation | 2.2264269 |
|---|---|
| Coefficient of variation (CV) | 0.28478391 |
| Kurtosis | -0.25677983 |
| Mean | 7.8179518 |
| Median Absolute Deviation (MAD) | 1.4583934 |
| Skewness | 0.60784747 |
| Sum | 7817.9518 |
| Variance | 4.9569768 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 13.12097557 | 31 | 3.1% |
| 4.419275959 | 31 | 3.1% |
| 7.729812417 | 3 | 0.3% |
| 6.077005842 | 3 | 0.3% |
| 4.488875137 | 2 | 0.2% |
| 8.274660114 | 2 | 0.2% |
| 8.623804265 | 2 | 0.2% |
| 5.490901565 | 2 | 0.2% |
| 7.90253124 | 2 | 0.2% |
| 6.709694479 | 2 | 0.2% |
| Other values (892) | 920 |
| Value | Count | Frequency (%) |
| 4.419275959 | 31 | |
| 4.423799272 | 1 | 0.1% |
| 4.432832052 | 1 | 0.1% |
| 4.433959855 | 1 | 0.1% |
| 4.446346815 | 1 | 0.1% |
| 4.447471192 | 1 | 0.1% |
| 4.477722635 | 1 | 0.1% |
| 4.478839135 | 1 | 0.1% |
| 4.488875137 | 2 | 0.2% |
| 4.497777229 | 1 | 0.1% |
| Value | Count | Frequency (%) |
| 13.12097557 | 31 | |
| 13.10267148 | 1 | 0.1% |
| 12.9953838 | 1 | 0.1% |
| 12.99230542 | 1 | 0.1% |
| 12.97536127 | 1 | 0.1% |
| 12.97266357 | 1 | 0.1% |
| 12.86740067 | 1 | 0.1% |
| 12.85846025 | 2 | 0.2% |
| 12.8 | 1 | 0.1% |
| 12.72713636 | 1 | 0.1% |
Interactions
Correlations
| amount | day | log_amount | month | payment_type | product_id | sqrt_amount | year | |
|---|---|---|---|---|---|---|---|---|
| amount | 1.000 | -0.012 | 0.931 | 0.013 | 0.061 | 0.045 | 0.931 | 0.041 |
| day | -0.012 | 1.000 | -0.013 | -0.016 | 0.028 | 0.000 | -0.013 | 0.037 |
| log_amount | 0.931 | -0.013 | 1.000 | 0.041 | 0.067 | 0.000 | 1.000 | 0.000 |
| month | 0.013 | -0.016 | 0.041 | 1.000 | 0.000 | 0.058 | 0.041 | 0.171 |
| payment_type | 0.061 | 0.028 | 0.067 | 0.000 | 1.000 | 0.073 | 0.058 | 0.000 |
| product_id | 0.045 | 0.000 | 0.000 | 0.058 | 0.073 | 1.000 | 0.000 | 0.101 |
| sqrt_amount | 0.931 | -0.013 | 1.000 | 0.041 | 0.058 | 0.000 | 1.000 | 0.000 |
| year | 0.041 | 0.037 | 0.000 | 0.171 | 0.000 | 0.101 | 0.000 | 1.000 |
Missing values
A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
Sample
| transaction_id | user_id | product_id | amount | payment_type | date | day | month | year | log_amount | sqrt_amount | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | T000001 | U0024 | P015 | 67.67 | Wallet | 2023-02-12 | 12 | 2 | 2023 | 4.229312 | 8.226178 |
| 1 | T000002 | U0196 | P044 | 76.44 | UPI | 2023-03-24 | 24 | 3 | 2023 | 4.349503 | 8.742997 |
| 2 | T000003 | U0196 | P049 | 104.57 | Debit Card | 2025-08-21 | 21 | 8 | 2025 | 4.659374 | 10.225947 |
| 3 | T000004 | U0133 | P042 | 102.75 | Net Banking | 2024-07-23 | 23 | 7 | 2024 | 4.641984 | 10.136567 |
| 4 | T000005 | U0047 | P038 | 23.89 | Net Banking | 2025-10-04 | 4 | 10 | 2025 | 3.214466 | 4.887740 |
| 5 | T000006 | U0024 | P031 | 31.10 | UPI | 2025-04-16 | 16 | 4 | 2025 | 3.468856 | 5.576737 |
| 6 | T000007 | U0086 | P021 | 74.37 | Wallet | 2023-08-03 | 3 | 8 | 2023 | 4.322409 | 8.623804 |
| 7 | T000008 | U0042 | P022 | 74.31 | Net Banking | 2025-10-11 | 11 | 10 | 2025 | 4.321613 | 8.620325 |
| 8 | T000009 | U0074 | P043 | 74.37 | Net Banking | 2023-03-17 | 17 | 3 | 2023 | 4.322409 | 8.623804 |
| 9 | T000010 | U0117 | P006 | 57.00 | UPI | 2024-05-31 | 31 | 5 | 2024 | 5.393355 | 13.120976 |
| transaction_id | user_id | product_id | amount | payment_type | date | day | month | year | log_amount | sqrt_amount | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 990 | T000991 | U0038 | P004 | 58.16 | Wallet | 2024-08-27 | 27 | 8 | 2024 | 4.080246 | 7.626270 |
| 991 | T000992 | U0053 | P049 | 24.52 | Cash | 2023-03-08 | 8 | 3 | 2023 | 3.239462 | 4.951767 |
| 992 | T000993 | U0092 | P029 | 38.06 | Credit Card | 2024-04-04 | 4 | 4 | 2024 | 3.665099 | 6.169279 |
| 993 | T000994 | U0191 | P042 | 66.15 | Cash | 2025-03-07 | 7 | 3 | 2025 | 4.206929 | 8.133265 |
| 994 | T000995 | U0061 | P033 | 20.99 | Wallet | 2024-08-12 | 12 | 8 | 2024 | 3.090588 | 4.581484 |
| 995 | T000996 | U0178 | P043 | 71.11 | Credit Card | 2025-02-20 | 20 | 2 | 2025 | 4.278193 | 8.432675 |
| 996 | T000997 | U0100 | P027 | 53.96 | Wallet | 2024-10-02 | 2 | 10 | 2024 | 4.006606 | 7.345747 |
| 997 | T000998 | U0142 | P004 | 76.06 | Credit Card | 2024-05-29 | 29 | 5 | 2024 | 4.344584 | 8.721238 |
| 998 | T000999 | U0052 | P040 | 62.45 | Net Banking | 2024-04-06 | 6 | 4 | 2024 | 4.150252 | 7.902531 |
| 999 | T001000 | U0163 | P046 | 123.78 | Wallet | 2025-01-23 | 23 | 1 | 2025 | 4.826552 | 11.125646 |